Transcription factor search for a DNA promoter in a three-states model 
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To ensure fast gene activation, Transcription Factors (TF) use a mechanism known as facilitated 
diffusion to find their DNA promoter site. Here we analyze such a process where a TF alternates 
between 3D and ID diffusion. In the latter (TF bound to the DNA), the TF further switches between 
a fast translocation state dominated by interaction with the DNA backbone, and a slow examination 
state where interaction with DNA base pairs is predominant. We derive a new formula for the mean 
search time, and show that it is faster and less sensitive to the binding energy fluctuations compared 
to the case of a single sliding state. We find that for an optimal search, the time spent bound to 
the DNA is larger compared to the 3D time in the nucleus, in agreement with recent experimental 
data. Our results further suggest that modifying switching via phosphorylation or methylation of 
the TF or the DNA can efficiently regulate transcription. 



Transcription factors (TFs) are messengers regulating 
gene activation by binding the DNA at specific promoter 
sites. Interestingly, both theoretical and experimental 
evidences show [l45[ that a TF finds rapidly its pro- 
moter site by facilitated diffusion, where it alternates 
between a 3D diffusion inside the nucleus and a ID diffu- 
sion (sliding) along the DNA strand. Facilitated diffusion 
was introduced to resolve the apparent paradox that the 
measured in- vitro association rate of the Lac-I repressor 
with its promoter site placed on A-phage DNA [g] was 
kfi ^ 10^° (Ms)~^, which is ~ 100 times larger than 
the Smoluchowski rate for a pure 3D diffusion search. 
However, the in-vivo mean time r for the Lac repres- 
sor to find its promoter site in E-Coli is around 350s 
0, from which we estimate that the association rate in 
a nucleus with volume \V\ ^ l^m^ is approximated by 
kE = Nav\V\/t - 106(Ms)-i {Nav is the Avogadro 
constant). The difference fee <^ ku is due to a slow 
ID motion [1, such that frequent non-specific bind- 
ings with the DNA in a crowded nucleus slow down the 
search and reduce the association rate. Theoretical anal- 
ysis 0, Q shows that the effective ID diffusion constant 
for sliding along the DNA decays exponentially with the 
variance a of the binding energy distribution between a 
TF and the underlying DNA, and a realistic search time 
can only be achieved for smooth profiles with a < I.S/cbT 
0. However, binding energy estimations for the Cro and 
PurR TF on E. Coli DNA [3, i9j show a much larger 
variance, suggesting that a simple sliding process is not 
sufficient to explain the search dynamics when the TF is 
bound to the DNA. In a more complex model 0,[l3|) sup- 



ported by experimental observations [11[, a TF switches 
between two conformations when bound to the DNA: in 
one state it is insensitive to the underlying DNA sequence 
and diffuses quickly in a smooth energy landscape, while 
in a second state it interacts with the DNA, reducing the 
motion. The impact of such switching has been inves- 
tigated in J^] based on equilibrium considerations. In 
general, switching processes are important because they 
modulate the rate of chemical reactions and lead to in- 
teresting behavior [l3l - [l5| . 



Here we study the mean first passage time (MFPT) for 
a TF to bind to its promoter site when it freely moves 
in the nucleus, but once bound to the DNA, it alter- 
nates between two states (Fig.[T]): in state 1, it interacts 
with individual bp, while in state 2 it is insensitive to 
the underlying bp sequence and interacts with the DNA 
backbone. Therefore, in state 1 motion occurs in a rough 
energy landscape approximated by an effective diffusion 
with a slow diffusion constant Di, while in state 2 dif- 
fusion is faster {D2 3> -Di) and occurs in a smooth po- 
tential well generated by the interaction with the DNA 
backbone. The translocations in state 2 are comparable 
to 'hoppings' along the DNA. The switching dynamics 
is Poissonian with rates ki2 and ^21 that depend on the 
energy profile (Fig.[TjD). In general, the binding time ki2 
depends on the DNA sequence and therefore on the posi- 
tion along the DNA, however, in first approximation, we 
use a constant value. In state 2, in addition to switching 
to state 1, the TF can detach from the DNA with rate 
^23 and switch to state 3, where it diffuses in the nucleus 
before reattaching in state 2 after an average time fc^^. 



investigated in [2|, Il6l - ll8| . Due to the packed and coiled 
DNA conformation, we approximate the TF reattach- 
ment locations as uncorrelated and randomly distributed 
along the DNA 0,[ii-l21| . We derive a new expression for 
the MFPT to find a promoter site (eq. [3]), and we show 
that 1) this time is not very sensitive to binding energy 
fiuctuations, contrary to previous models with a single 
sliding state, and 2) an optimal search process (eq.|l]) pro- 
ceeds such that a TF spends more time bound the DNA 
compared to freely diffusing in the nucleus, in agreement 
with recent experiments 

We start the analysis by considering diffusion along 
the DNA in the ID interval < x < L (a; is the DNA 
contour length) with switching between state 1 and 2. 
The target is located at x = and can only be found 
in state 1. To derive an expression for the MFPT, we 
use the sojourn times i„m(a;) a particle spends in state 
n (n = 1,2,3) when it started in state m = 1,2 at a 
DNA position x. Because a TF attaches to the DNA at 
a random position x, when starting the search in state 
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FIG. 1: (a) Search scenario with 3 states, (b) Energy profile 
and switching rates between states. 

3, the sojourn times do not depend on the initial posi- 
tion, and we have i„3 = r„3 = const. The times r„3 are 
related to the spatially averaged sojourn times Tnm = 
L^^ tnm{x)dx. Considering that a TF can only bind 
to the target in state 1, we have the relations T13 — T12, 
T23 = fci2/A;2iTi3 + 1/^21 and r33 = ^23/^327-23 + 1/^32- 
The coupled system of equations describing tii(x) and 
ii2(a;) is (we suppress the x dependency) 



i^2ii2-fc2l(il2-tll)-fc23(tl2-Ti2) = 



(1) 



with boundary conditions tii(O) = — ^12(0) — 

t'i2{L) = 0. The remaining sojourn times t2m{x) and 
hm{x) are t2ra{x) = ki2/k2itirn{x) + ^21^(1 " ^mi) and 
hm{x) = ^23/^32*2™ (a;). By integrating eq. [Uwe further 
obtain the intuitive relation rn = T12. Hence, start- 
ing initially uniformly distributed in state m, the MFPT 
r(m) = Tirn + T2m + T^m Can bc expressed in terms 
of Til only. In particular, starting in state 1, we have 
r(l) = Tii(l + fci2/fc2i + ^12/023/(^21^32))- 

Using the variables x — x/L, I12 = ^12/ [L'^Di), 
hi = k2i/{L'^D2) and ^23 = ^23/(^^-^2), and the func- 
tions vi{x) — ki2Tii{x) and ^2(2;) ~ ki2Ti2{x) {vi is the 
mean number of switchings between state 1 and 2), the 
solutions of eq. [T] are 



vi{x)\ _ hi ( cosh(\/^L2M2(l - ^)) 
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where ^2 = \/(l + (^21 + ^23)/^i2)^ - 4^23/^12, = 

-1 + (^21 +/23)/^12, = 1 + (a -6)/2, Mi - H 

(ei+6)/2 and ej = (^12(6 + 6)/(2/20, 1), ej = 
(^12(6 - 6)/(2^2i), !)• The average wi = vi{x)dx is 
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Because 61 61 Mi and M2 are all independent of i, vi 
depends on L only via /12. The relevant physical param- 
eters are L, fci2, fei, ^3, ^32, Di and Z)2- However, to 
facilitate our further discussion, we shall now character- 



ize the rates fci2, ^21 and ^23 by the detaching probability 
q = ^23/(^1 + ^23) to switch from state 2 to 3 (p = 1 — g 
is the probability to switch from state 2 to 1) and the 
lengths hi = \jD1lk12 and ls2 = \J^2l(kii + ^23), cor- 
responding to the average sliding distances in state 1 and 
2 before switching. The spatially averaged search time 
T « r(l) is 
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Before detaching and switching to state 3, a TF stays 
bound to the DNA for an average time tdna — ^23^ + 
ky2p/q, and the overall ratio of the mean time bound to 
the DNA to the mean time spent in state 3 is 
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When switching between state 1 and 2 is fast and diffu- 
sion in state 1 is negligible compared to state 2 {Di <C 
D2), then the diffusion constant with which a TF appears 
to slide along the DNA is 
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(5) 



When the parameters L, D2 and ^32 are given, we 
shall now study how the search process depends on hii 
h2, Q and Di. Modulating these parameters can be a 
way to regulate gene expression. Because a TF moves 
in state 2 in a smooth potential, we consider that D2 
is comparable the 3D diffusion constant. In contrast, 
in state 1, the TF interacts with individual bp and the 
effective diffusion constant is much reduced and can be 
written as Di = D2e~^, where x > depends on the 
binding energy profile. For a single sliding state, x 
related to the variance of the binding energy ^, . In 
general, x depends on the DNA sequences and therefore 
on the position along the DNA, however, we consider a 
constant average value here. We later on show that the 
search is not much sensitive to x variations in x, as long as 
X is not too large. We now proceed with the asymptotic 
analysis in the regime where k, — II1/II2 ^ 1 and q <C 
1. The condition n <^ \ avoids a redundant search in 
state 1 where diffusion is slow. As long as switching 
between state 1 and 2 is fast compared to the time k^2 
spent in state 3, the limit q <C 1 avoids too frequent 
detaching from the DNA that would increase the search 
time. Under the condition that k <C 1 and q ^ 1, we 
have the asymptotic 6 ~ ^ (1 ^ '*)i 6 ~ 1 + «^(1 ~ 2g), 
Ml ~ Kg, /i^ « 1 -I- K and vi « L/hi{i + \/n/<l)- Using 
these expressions in eq. [3] and eq. |4l we find 



L2 



D2k 



2K32 



q J \ a an 



aq 



(6) 



3 



where a = \/ D2 / {l^ik32) ■ When a and x are fixed, the 
minimum of T{q, k) as a function of k and q is achieved 

for (Kmm, q,mri) = {\/ 2 / (ae^^) , K^]„) , and 

r™„ = 1 + ^2ex/a. (9) 

For e^/a <C 1, the asymptotic expansion is Tmin ~ 
2^L'^/{D2k32){i + \/2e^/a), showing that Tmin does not 
depend exponentially on x in that regime. We now com- 
pare our results with the ones for a single sliding state: 
when a TF alternates only between state 1 and 3 with 
rates fcia and fcai (the intermediate state 2 is absent), 
we find from eq. [2] that vi = ^/Ti3 = L"^ / {Diki^), 
and for the search time we recover the expression f = 
./UJiD^k^ik^i + Ki) S 0, H0_When fcsi is 
fixed, the minimum fmin = 2^ L'^ / [Dik^i) is achieved 
for kis = fcsi, and r is always one at the minimum with 
a single sliding state, which is not any longer the case in 
the two states sliding model. 

We now proceed with some numerical estimations us- 
ing parameters for E.coli bacteria: L — 2.4 x lO^bp (half 
the size of E. coli DNA, to compensate that the target is 
located at the boundary in our analysis), fc32 = (1.4/is)~^ 
[sl. [l6l| and D2 — 2^nT?, comparable to the 3D diffusion 
constant In Fig. [2^, we plot the minimum of r as a 
function of x a-nd for various Isi = (0.5, 1,3,5) (in units 
of bp). The case I si — 0.5 can be considered as an effec- 
tive description of a physical search process where a TF 
is bound and immobile in state 1 (similar to the scenario 
considered in pjj): after switching back to state 2, the 
TF position in state 2 has changed only slightly in the 
range of a single bp (the average of the maximum diffu- 
sion length in state 1 is 2lsi — 2^ D\lk\2 = 1). This 
position change can also be interpreted as the variability 
due to the unbinding process. The mean binding time 
^12 depends on the energy barrier Ai? (in units of ksT) 
separating state 1 from 2. Comparing the Arrhenius for- 
mula ki2 = ^e""^^, where ^ is an effective prefactor, 
with ki2 = Di/P^^ = Dae^^/Z^i, we identify x ^ 
and ^ — 02/1^1- Hence, for Igi small, the parameter x is 
the binding energy, however, for large Isi, X is related to 
the variance of the binding energy landscape in state 1, 
as described in [tI. [sl. [let . 

Fig. [2^ shows that Tmm is initially not very sensitive 
to X until X ^ Ina (for Isi = 0.5 we have In a ^ 6). 
In contrast, with a single sliding state the minimum 
T,nin — 2y/ L'^ / {D2k32)e^^^ (with ksi = ^32) increases ex- 
ponentially with X E^nd quickly reaches much higher val- 
ues (black dashed curve in Fig. [5^). Furthermore, within 
the two states sliding model, the novel feature is that 




FIG. 2; Optimal search process. Quantities are plotted as a 
function of x and for various length hi. The parameter values 
are L = 2.4 x 10®bp, ^32 = (1.4ms)~\ D2 = 2^tm^s"^ and 
Di — D2e~^. The dashed curve in (a) corresponds to the 
minimal time with a single sliding state. Tmin is given in sec. 

the time ratio Tmin at the minimum is not constant but 
increases with x (Fig-ISt)- As a consequence, the experi- 
mental findings that a TF spends more time bound to the 
DNA compared to diffusing inside the nucleus [HI is now 
compatible with an optimal search process. For example, 
for Isi = 0.5, the experimental results r^xp ~ 350s and 
Texp ~ 5 [5| are compatible with a value x ~ 8 (Fig. [2^- 
b). Because diffusion in state 1 slows down as x rises, 
the sliding distance ls2,min and the probability qmin to 
switch from state 2 to state 3 increase, thereby reduc- 
ing the probability of recurrently visiting the same DNA 
site in state 1 (Fig. [2};-d). Surprisingly, a larger detach- 
ing probability qmin does not lead to a higher fraction 
of time spent in state 3, which is counter intuitive {vmin 
increases. Fig. [2}d). To study the impact of increasing 
the binding strength in state 1, while the motion in state 
2 (interaction with DNA backbone) is not affected, we 
plotted r as a function of x (Fig. |3^-b) for I si — 0.5 and 
various ^^2 and q that are independent of x- This is in 
contrast with Fig. [21 where Tmin is achieved for values of 
qmin and ls2,min that do depend on x and Isi- In Fig.[3t, 
we plot the apparent diffusion constant Da (sliding along 
the DNA) as a function of x, with parameters associated 
with panel a. Da decreases as x increases, and for x ~ 7, 
we have Da ~ OA^ni?'s~^ , which is similar to measure- 
ments Within a single sliding state model, the ID dif- 
fusion coefficient Di — D2er^l^ decreases much faster as 
function of x compared to Da (dashed line in Fig. [3]:). We 
conclude that experimental measurements of the appar- 
ent diffusion constant are compatible with much stronger 
binding energies in a two-state compared to a single state 
model. Finally, we show how r is modulated by varying 
qor/,2 for X- 7.5 (Fig.[3H). 

To conclude, we showed here that the TF search time, 
characterized by switching between two states on the 
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3D is approximately exponentially distributed, both of 
which are widely used and accepted in the literature. 
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FIG. 3: (a) Search time r (in sec) for Zsi = 0.5 and q — 0.01. 
The dashed Une is the minimum of t. (b) r for hi = 0.5 and 
I32 = 10. (c) Apparent diffusion constant Da scaled by D2 for 
the situation in (a). The dashed hne is the diffusion constant 
D1/D2 = e~^^^ for a model with a single sliding state, (d) 
Search time r for x = 7.5 showing a minimum. 

DNA, is considerably faster and less sensitive to bind- 
ing energy fluctuations compared to a single ID sliding 
state. Performing fast translocations ('hoppings') of the 
order of lObp in state 2 speeds up the search time by 
reducing a slow recurrent search in state 1. In our anal- 
ysis, switching between state 1 and 2 is a common and 
necessary feature of the search mechanism, in contrast 
to scenarios, where it is induced at strong DNA binding 
sites f^. State 2 further offers the possibility that a TF 
moves along the DNA by translation without the need 
to follow the double-helix rotation. Furthermore, since 
DNA promoter sequences are usually > 10bps and even 
present in several copies 24 , 2^ , small translocations in 
state 2 are unlikely to overshoot the target region. We 
show that an optimal search in our switching model in- 
volves a larger time spent bound to the DNA compared 
to diffusing in the nucleus, in agreement with experimen- 
tal findings [i]. Finally, we find that the search time is 
very sensitive to changes in the detaching probability q. 
Hence, changing the TF interaction with the DNA back- 
bone via modifying the electrical properties of the TF 
or the DNA by phosphorylation, methylation or acetyla- 
tion is an efficient way to modulate the search time, and 
ultimately the cellular response. Future works should 
clarify the impact of the binding energy fluctuations in 
state 1, and should analyze in details the 3D dynamics, 
for example by considering DNA coiling [la. Moreover, 
in eukaryotes, the compact DNA structure [26| and pos- 
sible nuclear transport mechanism (27j might as well be 
critical. Nevertheless, we expect that our results derived 
here remain a good approximation as long as subsequent 
attaching positions to the DNA are well separated com- 
pared to the average distance a TF slides along the DNA 
before detaching (around lOObp), and the time spent in 
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To ensure fast gene activation, Transcription Factors (TF) use a mechanism known as facilitated 
diffusion to find their DNA promoter site. Here we analyze such a process where a TF alternates 
between 3D and ID diffusion. In the latter (TF bound to the DNA), the TF further switches between 
a fast translocation state dominated by interaction with the DNA backbone, and a slow examination 
state where interaction with DNA base pairs is predominant. We derive a new formula for the mean 
search time, and show that it is faster and less sensitive to the binding energy fluctuations compared 
to the case of a single sliding state. We find that for an optimal search, the time spent bound to 
the DNA is larger compared to the 3D time in the nucleus, in agreement with recent experimental 
data. Our results further suggest that modifying switching via phosphorylation or methylation of 
the TF or the DNA can efficiently regulate transcription. 



Transcription factors (TFs) are messengers regulating 
gene activation by binding the DNA at specific promoter 
sites. Interestingly, both theoretical and experimental 
evidences show [?????] that a TF finds rapidly 
its promoter site by facilitated diffusion, where it alter- 
nates between a 3D diffusion inside the nucleus and a 
ID diffusion (sliding) along the DNA strand. Facilitated 
diffusion was introduced to resolve the apparent para- 
dox that the measured in-vitro association rate of the 
Lac-I repressor with its promoter site placed on A-phage 
DNA [? ] was kR ~ 10^° (M s)-\ which is ~ 100 times 
larger than the Smoluchowski rate for a pure 3D diffusion 
search. However, the in-vivo mean time r for the Lac re- 
pressor to find its promoter site in E-Coli is around 350s 
[? ], from which we estimate that the association rate 
in a nucleus with volume \V\ ~ l^m^ is approximated 
by kE = Nav\V\/t - 10'^(Ms)-i {Nav is the Avogadro 
constant). The difference ks <C ka is due to a slow ID 
motion [? ? ], such that frequent non-specific bindings 
with the DNA in a crowded nucleus slow down the search 
and reduce the association rate. Theoretical analysis [? ? 
] shows that the effective ID diffusion constant for sliding 
along the DNA decays exponentially with the variance a 
of the binding energy distribution between a TF and the 
underlying DNA, and a realistic search time can only be 
achieved for smooth profiles with a < l.SfcsT [? ]. How- 
ever, binding energy estimations for the Cro and PurR 
TF on E. Coli DNA [? ? ] show a much larger variance, 
suggesting that a simple sliding process is not sufficient 
to explain the search dynamics when the TF is bound to 
the DNA. In a more complex model [? ? ], supported by 
experimental observations [? ], a TF switches between 
two conformations when bound to the DNA: in one state 
it is insensitive to the underlying DNA sequence and dif- 
fuses quickly in a smooth energy landscape, while in a 
second state it interacts with the DNA, reducing the mo- 
tion. The impact of such switching has been investigated 
in [? ] based on equilibrium considerations. In general, 
switching processes are important because they modu- 
late the rate of chemical reactions and lead to interesting 
behavior [? ? ? ]. 



Here we study the mean first passage time (MFPT) for 
a TF to bind to its promoter site when it freely moves 
in the nucleus, but once bound to the DNA, it alter- 
nates between two states (Fig. [1]): in state 1, it interacts 
with individual bp, while in state 2 it is insensitive to 
the underlying bp sequence and interacts with the DNA 
backbone. Therefore, in state 1 motion occurs in a rough 
energy landscape approximated by an effective diffusion 
with a slow diffusion constant Di, while in state 2 dif- 
fusion is faster (D2 ^ Di) and occurs in a smooth po- 
tential well generated by the interaction with the DNA 
backbone. The translocations in state 2 are comparable 
to 'hoppings' along the DNA. The switching dynamics 
is Poissonian with rates ki2 and ^21 that depend on the 
energy profile (Fig. ^p) . In general, the binding time ki2 
depends on the DNA sequence and therefore on the posi- 
tion along the DNA, however, in first approximation, we 
use a constant value. In state 2, in addition to switching 
to state 1, the TF can detach from the DNA with rate 
A:23 and switch to state 3, where it diffuses in the nucleus 
before reattaching in state 2 after an average time k^2 ^ 
investigated in [? ? ? ? ]. Due to the packed and coiled 
DNA conformation, we approximate the TF reattach- 
ment locations as uncorrelated and randomly distributed 
along the DNA [????]. We derive a new expres- 
sion for the MFPT to find a promoter site (eq. [3]), and 
we show that 1) this time is not very sensitive to bind- 
ing energy fluctuations, contrary to previous models with 
a single shding state, and 2) an optimal search process 
(eq. SI proceeds such that a TF spends more time bound 
the DNA compared to freely diffusing in the nucleus, in 
agreement with recent experiments [? ]. 

We start the analysis by considering diffusion along 
the DNA in the ID interval < x < L (x is the DNA 
contour length) with switching between state 1 and 2. 
The target is located at x = and can only be found 
in state 1. To derive an expression for the MFPT, we 
use the sojourn times t„m(x) a particle spends in state 
n [n — 1,2,3) when it started in state m = 1,2 at a 
DNA position x. Because a TF attaches to the DNA at 
a random position x, when starting the search in state 
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FIG. 1: (a) Search scenario with 3 states, (b) Energy profile 
and switching rates between states. 

3, the sojourn times do not depend on the initial posi- 
tion, and we have i„3 = r„3 = const. The times r„3 are 
related to the spatially averaged sojourn times Tnm = 
L^^ tnm{x)dx. Considering that a TF can only bind 
to the target in state 1, we have the relations T13 — T12, 
T23 = ki2/k2iTi3 + 1/^21 and r33 = ^23/^327-23 + 1/^32- 
The coupled system of equations describing tii(x) and 
ii2(x) is [? ] (we suppress the x dependency) 



Ciii'i -fci2(iii -ti2) = -1 

i^2ii2-fc2l(il2-tll)-fc23(tl2-Ti2) = 



(1) 



with boundary conditions tii(O) = — ^12(0) — 

t'i2{L) = 0. The remaining sojourn times t2m{x) and 
hm{x) are t2ra{x) = ki2/k2itirn{x) + ^21^(1 " ^mi) and 
hm{x) = ^23/^32*2™ (a;). By integrating eq. [Uwe further 
obtain the intuitive relation rn = T12. Hence, start- 
ing initially uniformly distributed in state m, the MFPT 
r(m) = Tirn + T2m + T^m Can bc expressed in terms 
of Til only. In particular, starting in state 1, we have 
r(l) = Tii(l + fci2/fc2i + ^12/023/(^21^32))- 

Using the variables x — x/L, I12 = ki2/{L'^Di), 
hi = k2i/{L'^D2) and ^23 = ^23/(^^-^2), and the func- 
tions vi{x) — ki2Tii{x) and ^2(2;) ~ ki2Ti2{x) {vi is the 
mean number of switchings between state 1 and 2), the 
solutions of eq. [T] are 



vi{x)\ _ hi ( cosh(\/^L2M2(l - ^)) 

V2{x) 



1 



C2 V \/Sm2 sinh( \/Ii2/i2 
hi_ f cosh(Vr^^i(l - x)) ^ 1 
6 Vv^Mi smh(v^Aii) ^12/^1 



^12A*2 
6*1 -t- Vi 
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where ^2 = \/(l + (^21 + ^23)/^i2)^ - 4^23/^12, = 

-1 + (^21 +/23)/^12, = 1 + (a -6)/2, Mi - H 

(ei+6)/2 and ej = (^12(6 + 6)/(2/20, 1), ej = 
(^12(6 - 6)/(2^2i), !)• The average wi = vi{x)dx is 



g2-a 

26 
26 



coth(vI^^2) 1 

tl2 o 

M2 M2 

C0th(V^L2Ml) 1 



Ml 



M? 



(2) 



Because ^1, £,2, Mi and M2 are all independent of i, vi 
depends on L only via /12. The relevant physical param- 
eters are L, fci2, fei, ^3, ^32, Di and Z)2- However, to 
facilitate our further discussion, we shall now character- 



ize the rates fci2, ^21 and ^23 by the detaching probability 
q = ^23/(^1 + ^23) to switch from state 2 to 3 (p = 1 — g 
is the probability to switch from state 2 to 1) and the 
lengths hi = \/Di/ki2 and h2 = \J^2l(k2i + ^23), cor- 
responding to the average sliding distances in state 1 and 
2 before switching. The spatially averaged search time 
T « r(l) is 
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Before detaching and switching to state 3, a TF stays 
bound to the DNA for an average time tdna — ^23^ + 
ky2p/q, and the overall ratio of the mean time bound to 
the DNA to the mean time spent in state 3 is 
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When switching between state 1 and 2 is fast and diffu- 
sion in state 1 is negligible compared to state 2 {Di <C 
D2), then the diffusion constant with which a TF appears 
to slide along the DNA is 



Da 



Do 



Do 



l + fel/fcl2 l+pD2llJ{Dill2) 



(5) 



When the parameters L, D2 and ^32 are given, we 
shall now study how the search process depends on hii 
h2, Q and Di. Modulating these parameters can be a 
way to regulate gene expression. Because a TF moves 
in state 2 in a smooth potential, we consider that D2 
is comparable the 3D diffusion constant. In contrast, 
in state 1, the TF interacts with individual bp and the 
effective diffusion constant is much reduced and can be 
written as Di = D2e~^, where x > depends on the 
binding energy profile. For a single sliding state, x 
related to the variance of the binding energy [? ? ] . In 
general, x depends on the DNA sequences and therefore 
on the position along the DNA, however, we consider a 
constant average value here. We later on show that the 
search is not much sensitive to x variations in x, as long as 
X is not too large. We now proceed with the asymptotic 
analysis in the regime where k, — II1/II2 ^ 1 and q <C 
1. The condition n <^ \ avoids a redundant search in 
state 1 where diffusion is slow. As long as switching 
between state 1 and 2 is fast compared to the time k^2 
spent in state 3, the limit q <C 1 avoids too frequent 
detaching from the DNA that would increase the search 
time. Under the condition that k <C 1 and q ^ 1, we 
have the asymptotic 6 ~ ^ (1 ^ '*)i 6 ~ 1 + '^(1 ^ 2^), 
Ml ~ Kg, /ii « 1 -I- K and vi « L/hi{^ + \/n/<l)- Using 
these expressions in eq. [3] and eq. |4l we find 
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where a = \/ D2 / {l^ik32) ■ When a and x are fixed, the 
minimum of T{q, k) as a function of k and q is achieved 

for (Kmm, q,mri) = {\/ 2 / (ae^^) , K^]„) , and 

r™„ = 1 + ^2ex/a. (9) 

For e^/a <C 1, the asymptotic expansion is Tmin ~ 
2^L'^/{D2k32){i + \/2e^/a), showing that Tmin does not 
depend exponentially on x in that regime. We now com- 
pare our results with the ones for a single sliding state: 
when a TF alternates only between state 1 and 3 with 
rates fcia and fcai (the intermediate state 2 is absent), 
we find from eq. [2] that vi — y^Tis = L'^ /{Diki^), 
and for the search time we recover the expression f — 
./UJiD^k^ik^i + Ki) [? ? ? ? ]. W hen fcai is 
fixed, the minimum f,„i„ — 2y^L^ /(Diksi) is achieved 
for ki3 = ksi, and r is always one at the minimum with 
a single sliding state, which is not any longer the case in 
the two states sliding model. 

We now proceed with some numerical estimations us- 
ing parameters for E.coli bacteria: L — 2.4 x lO^bp (half 
the size of E. coli DNA, to compensate that the target is 
located at the boundary in our analysis), ^32 = (1.4/is)~^ 
[? ? ] and D2 = 211111? , comparable to the 3D diffusion 
constant [? ] . In Fig. [2^, we plot the minimum of r as a 
function of x a-nd for various Isi = (0.5, 1,3,5) (in units 
of bp). The case = 0.5 can be considered as an effec- 
tive description of a physical search process where a TF 
is bound and immobile in state 1 (similar to the scenario 
considered in [? ]): after switching back to state 2, the 
TF position in state 2 has changed only slightly in the 
range of a single bp (the average of the maximum diffu- 
sion length in state 1 is 2lsi — 2^ D\lk\2 = 1). This 
position change can also be interpreted as the variability 
due to the unbinding process. The mean binding time 
^12 depends on the energy barrier Ai? (in units of ksT) 
separating state 1 from 2. Comparing the Arrhenius for- 
mula ki2 = ^e""^^, where ^ is an effective prefactor, 
with ki2 = Di/P^^ = Dae^^/Z^i, we identify x ^ 
and ^ — 02/1^1- Hence, for Igi small, the parameter x is 
the binding energy, however, for large Isi, X is related to 
the variance of the binding energy landscape in state 1, 
as described in [? ? ? ]. 

Fig. shows that T„iin is initially not very sensitive 
to X until X ^ Ina (for Isi = 0.5 we have In a ^ 6). 
In contrast, with a single sliding state the minimum 
T,nin — 2y/ L'^ / {D2k32)e^^^ (with ksi = ^32) increases ex- 
ponentially with X E^nd quickly reaches much higher val- 
ues (black dashed curve in Fig. [5^). Furthermore, within 
the two states sliding model, the novel feature is that 




FIG. 2; Optimal search process. Quantities are plotted as a 
function of x and for various length hi. The parameter values 
are L = 2.4 x 10®bp, ^32 = (1.4ms)~\ D2 = 2^tm^s"^ and 
Di — D2e~^. The dashed curve in (a) corresponds to the 
minimal time with a single sliding state. Tmin is given in sec. 

the time ratio Tmin at the minimum is not constant but 
increases with x (Fig-ISt)- As a consequence, the experi- 
mental findings that a TF spends more time bound to the 
DNA compared to diffusing inside the nucleus [? ] is now 
compatible with an optimal search process. For example, 
for Isi — 0.5, the experimental results r^xp ~ 350s and 
Texp ~ 5 ] are compatible with a value x ~ 8 (Fig. [2^- 
b). Because diffusion in state 1 slows down as x rises, 
the sliding distance ls2,min and the probability qmin to 
switch from state 2 to state 3 increase, thereby reduc- 
ing the probability of recurrently visiting the same DNA 
site in state 1 (Fig. [2j:-d). Surprisingly, a larger detach- 
ing probability qmin does not lead to a higher fraction 
of time spent in state 3, which is counter intuitive {vmin 
increases. Fig. [2}d). To study the impact of increasing 
the binding strength in state 1, while the motion in state 
2 (interaction with DNA backbone) is not affected, we 
plotted r as a function of x (Fig. |3^-b) for I si — 0.5 and 
various ^^2 and q that are independent of x- This is in 
contrast with Fig. [21 where Tmin is achieved for values of 
qmin and ls2,min that do depend on x and Isi- In Fig.[3t, 
we plot the apparent diffusion constant Da (sliding along 
the DNA) as a function of x, with parameters associated 
with panel a. Da decreases as x increases, and for x ~ 7, 
we have Da ^ OA^'m^s~^ , which is similar to measure- 
ments [? ]. Within a single sliding state model, the 
ID diffusion coefficient Di — D2e^^/^ decreases much 
faster as function of x compared to Da (dashed line in 
Fig. [Ht). We conclude that experimental measurements 
of the apparent diffusion constant are compatible with 
much stronger binding energies in a two-state compared 
to a single state model. Finally, we show how r is mod- 
ulated by varying q or ls2 for x = 7.5 (Fig. [3jl). 

To conclude, we showed here that the TF search time, 
characterized by switching between two states on the 
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FIG. 3: (a) Search time r (in sec) for Zsi = 0.5 and q — 0.01. 
The dashed line is the minimum of t. (b) r for hi = 0.5 and 
I32 = 10. (c) Apparent diffusion constant Da scaled by D2 for 
the situation in (a). The dashed line is the diffusion constant 
D1/D2 = e~^^^ for a model with a single sliding state, (d) 
Search time r for x = 7.5 showing a minimum. 

DNA, is considerably faster and less sensitive to bind- 
ing energy fluctuations compared to a single ID sliding 
state. Performing fast translocations ('hoppings') of the 
order of lObp in state 2 speeds up the search time by 
reducing a slow recurrent search in state 1. In our anal- 
ysis, switching between state 1 and 2 is a common and 
necessary feature of the search mechanism, in contrast 
to scenarios, where it is induced at strong DNA binding 
sites [? ] . State 2 further offers the possibility that a TF 



moves along the DNA by translation without the need 
to follow the double-helix rotation. Furthermore, since 
DNA promoter sequences are usually > 10bps and even 
present in several copies [? ? ], small translocations in 
state 2 are unlikely to overshoot the target region. We 
show that an optimal search in our switching model in- 
volves a larger time spent bound to the DNA compared 
to diffusing in the nucleus, in agreement with experimen- 
tal findings [? ] . Finally, we find that the search time is 
very sensitive to changes in the detaching probability q. 
Hence, changing the TF interaction with the DNA back- 
bone via modifying the electrical properties of the TF or 
the DNA by phosphorylation, methylation or acetylation 
is an efficient way to modulate the search time, and ulti- 
mately the cellular response. Future works should clarify 
the impact of the binding energy fiuctuations in state 1, 
and should analyze in details the 3D dynamics, for ex- 
ample by considering DNA coiling [? ]. Moreover, in 
eukaryotes, the compact DNA structure [? ] and possi- 
ble nuclear transport mechanism [? ] might as well be 
critical. Nevertheless, we expect that our results derived 
here remain a good approximation as long as subsequent 
attaching positions to the DNA are well separated com- 
pared to the average distance a TF slides along the DNA 
before detaching (around lOObp), and the time spent in 
3D is approximately exponentially distributed, both of 
which are widely used and accepted in the literature. 
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